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RAW SEQUENCE LISTING DATE: 08/06/1999 

PATENT APPLICATION US/09/017, 715A TIME: 16:13:53 

INPUT SET: S32850.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



SEQUENCE LISTING 

) General Information: 

(i) APPLICANTS: Ji, Hongjun 

Rosen, Craig A. 

(ii) TITLE OF INVENTION: Breast Cancer Specific Gene 1 

(iii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sterne, Kessler, Goldstein & Fox P.L.L.C, 

(B) STREET: 1100 New York Ave., Suite 600 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005-3934 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1,0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/017,715 

(B) FILING DATE: 1998-FEB-03 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/673,284 

(B) FILING DATE: 28-JUN-96 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/000,602 

(B) FILING DATE: 30-JUN-95 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US95/08295 

(B) FILING DATE: 30-JUN-95 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 
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PAGE: 2 RAW SEQUENCE LISTING DATE: 08/06/ 1 999 

PATENT APPLICATION mi09l017, 715A TIME: 16: 13:53 

INPUT SET: S32850.raw 

47 (A) APPLICATION NUMBER: US 60/037,080 

48 (B) FILING DATE: 03-FEB-97 

4 9 (C) CLASSIFICATION: 
50 

51 (Viii) ATTORNEY/ AGENT INFORMATION: 

52 (A) NAME: Steffe, Eric K. 

5 3 (B) REGISTRATION NUMBER: 36,688 

54 (C) REFERENCE/DOCKET NUMBER: 1488.0810003 

55 

56 (ix) TELECOMMUNICATION INFORMATION: 

57 (A) TELEPHONE: 202-371-2600 

58 (B) TELEFAX: 202-371-2540 
59 

60 

61 (2) INFORMATION FOR SEQ ID N0:1: 
62 

6 3 (i) SEQUENCE CHARACTERISTICS: 

64 (A) LENGTH: 550 base pairs 

65 (B) TYPE: nucleic acid 

66 (C) STRANDEDNESS: double 

67 (D) TOPOLOGY: both 
68 

6 9 (ii) MOLECULE TYPE: cDNA 
70 

71 

72 (ix) FEATURE: 

7 3 (A) NAME/KEY: CDS 

74 (B) LOCATION: 12,. 392 

75 

76 

77 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

78 

7 9 CACGAGCCAC C ATG GAT GTT TTC AAG AAG GGC TTC TCC ATC GCC AAG AAG 50 

80 Met Asp Val Phe Lys Lys Gly Phe Ser lie Ala Lys Lys 

81 1 5 10 
82 

8 3 GGC GTG GTG GGT GCG GTG GAA AAG ACC AAG CAG GGG GTG ACG GAA GCA 98 

84 Gly Val Val Gly Ala Val Glu Lys Thr Lys Gin Gly Val Thr Glu Ala 

85 15 20 25 
86 

87 GCT GAG AAG ACC AAG GAG GGG GTC ATG TAT GTG GGA GCC AAG ACC AAG 146 

88 Ala Glu Lys Thr Lys Glu Gly Val Met Tyr Val Gly Ala Lys Thr Lys 

89 30 35 ^ 40 45 
90 

91 GAG AAT GTT GTA CAG AGC GTG ACC TCA GTG GCC GAG AAG ACC AAG GAG 194 

92 Glu Asn Val Val Gin Ser Val Thr Ser Val Ala Glu Lys Thr Lys Glu 

93 50 55 60 
94 

95 CAG GCC AAC GCC GTG AGC AAG GCT GTG GTG AGC AGC GTC AAC ACT GTG 24 2 

96 Gin Ala Asn Ala Val Ser Lys Ala Val Val Ser Ser Val Asn Thr Val 

97 65 70 75 
98 

9 9 GCC ACC AAG ACC GTG GAG GAG GCG GAG AAC ATC GCG GTC ACC TCC GGG 2 90 
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PAGE: 3 RAW SEQUENCE LISTING DATE: 08/06/1999 

PATENT APPLICATION US/Omi?, 715A TIME: 16: 1 3:54 

INPUT SET: S32850.raw 

100 Ala Thr Lys Thr Val Glu Glu Ala Glu Asn lie Ala Val Thr Ser Gly 

101 80 85 90 
102 

103 GTG GTG CGC AAG GAG GAC TTG AGG CCA TCT GCC CCC CAA CAG GAG GGT 338 

104 Val Val Arg Lys Glu Asp Leu Arg Pro Ser Ala Pro Gin Gin Glu Gly 

105 95 100 105 
106 

107 GAG GCA TCC AAA GAG AAA GAG GAA GTG GCA GAG GAG GCC CAG AGT GGG 386 

108 Glu Ala Ser Lys Glu Lys Glu Glu Val Ala Glu Glu Ala Gin Ser Gly 

109 110 115 120 125 
110 

111 GGA GAC TAGAGGGCTA CAGGCCAGCG TGGATGACCT GAAGAGCGCT CCTCTGCCTT 442 

112 Gly Asp 
113 

114 

115 GGACACCATC CCCTCCTAGC ACAAGGAGTG CCCGCCTTGA GTGACATGCG GGTGCCCACG 502 
116 

117 CTCCTGCCCT CGTCTCCCTG GACACCCTTG GCCTGTCCAC CTGTGCTG 550 

118 

119 

120 (2) INFORMATION FOR SEQ ID NO: 2: 
121 

122 (i) SEQUENCE CHARACTERISTICS: 

123 (A) LENGTH: 127 amino acids 

124 (B) TYPE: amino acid 

125 (D) TOPOLOGY: linear 
126 

127 (ii) MOLECULE TYPE: protein 

128 

12 9 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

130 

131 Met Asp Val Phe Lys Lys Gly Phe Ser lie Ala Lys Lys Gly Val Val 

132 15 10 15 
133 

134 Gly Ala Val Glu Lys Thr Lys Gin Gly Val Thr Glu Ala Ala Glu Lys 

135 20 25 30 
136 

. 137 Thr Lys Glu Gly Val Met Tyr Val Gly Ala Lys Thr Lys Glu Asn Val 
138 35 40 45 

139 

140 Val Gin Ser Val Thr Ser Val Ala Glu Lys Thr Lys Glu Gin Ala Asn 

141 50 55 60 
142 

143 Ala Val Ser Lys Ala Val Val Ser Ser Val Asn Thr Val Ala Thr Lys 

144 65 70 75 80 
145 

146 Thr Val Glu Glu Ala Glu Asn He Ala Val Thr Ser Gly Val Val Arg 

147 85 90 95 
148 

149 Lys Glu Asp Leu Arg Pro Ser Ala Pro Gin Gin Glu Gly Glu Ala Ser 

150 100 105 110 
151 

152 Lys Glu Lys Glu Glu Val Ala Glu Glu Ala Gin Ser Gly Gly Asp 
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RAW SEQUENCE LISTING DATE: 08/06/ 1 999 

PATENT APPLICATION mi09l017J15A TIME: 16:13:54 

INPUT SET: S32850.mw 

115 120 125 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGGGATCCAT GTTTTCAAGA AGG 2 3 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
-(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GGAAGCTTCT AGTCTCCCCC ACTCTGG 27 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
GGGGATCCCG ATGTTTTCAA GAAGG 2 5 
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RAW SEQUENCE LISTING DATE: 08/06/1999 

PATENT APPLICATION US/09/017, 715 A TIME: 16:13:54 

INPUT SET: S32850.raw 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHAEIACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGGGTACCCT AGTCTCCCCC ACTCTGG 27 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGGGATCCGC CACCATGTTT TCAAGAAGG 29 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGGGATCCTC AGAAAGCGTA GTCTGGGACG TCGTATGGGT ACTAGTCTCC CCC ACTCTGG 60 

(2) INFORMATION FOR SEQ ID NO: 9: 



PAGE: 1 SEQUENCE VERIFICATION REPORT DATE: 08/06/1999 

PATENT APPLICATION US/09/017,715A TIME: 16:13:55 

INPUT SET: S32850.mw 



Line Error Original Text 



